AITopics | shape generation

This trend is consistent with similar conclusions in the prior DDPM work [1]. These biased problems should be carefully solved for the deployment of real applications.

artificial intelligence, machine learning, visualization, (9 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

d6c01b025cad37d5c8bab4ba18846c02-Paper-Conference.pdf

Neural Information Processing SystemsFeb-17-2026, 08:59:07 GMT

artificial intelligence, machine learning, point cloud, (14 more...)

Neural Information Processing Systems

Country: Europe > Italy > Calabria > Catanzaro Province > Catanzaro (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Sensing and Signal Processing > Image Processing (0.68)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

655846cc914cb7ff977a1ada40866441-Supplemental-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 21:11:59 GMT

articulation tree, artificial intelligence, machine learning, (14 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
Asia (0.04)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Vision (0.70)

Add feedback

655846cc914cb7ff977a1ada40866441-Paper-Conference.pdf

Neural Information Processing SystemsFeb-12-2026, 21:11:55 GMT

artificial intelligence, arxiv preprint arxiv, machine learning, (11 more...)

Neural Information Processing Systems

Country:

North America > United States > Pennsylvania (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

ShapeCrafter: ARecursiveText-Conditioned 3DShapeGenerationModel

Neural Information Processing SystemsFeb-8-2026, 09:47:40 GMT

Existing methods that generate text-conditioned 3D shapes consume an entire text prompt to generate a 3D shape in a single step.

artificial intelligence, machine learning, natural language, (17 more...)

Neural Information Processing Systems

Country:

Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
Europe > Portugal > Lisbon > Lisbon (0.04)

Technology:

Information Technology > Artificial Intelligence > Vision (0.94)
Information Technology > Artificial Intelligence > Natural Language (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.46)

Add feedback

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation

Neural Information Processing SystemsDec-27-2025, 03:02:30 GMT

We present a novel alignment-before-generation approach to tackle the challenging task of generating general 3D shapes based on 2D images or texts. Directly learning a conditional generative model from images or texts to 3D shapes is prone to producing inconsistent results with the conditions because 3D shapes have an additional dimension whose distribution significantly differs from that of 2D images and texts. To bridge the domain gap among the three modalities and facilitate multi-modal-conditioned 3D shape generation, we explore representing 3D shapes in a shape-image-text-aligned space. Our framework comprises two models: a Shape-Image-Text-Aligned Variational Auto-Encoder (SITA-VAE) and a conditional Aligned Shape Latent Diffusion Model (ASLDM). The former model encodes the 3D shapes into the shape latent space aligned to the image and text and reconstructs the fine-grained 3D neural fields corresponding to given shape embeddings via the transformer-based decoder. The latter model learns a probabilistic mapping function from the image or text space to the latent shape space. Our extensive experiments demonstrate that our proposed approach can generate higher-quality and more diverse 3D shapes that better semantically conform to the visual or textural conditional inputs, validating the effectiveness of the shape-image-text-aligned space for cross-modality 3D shape generation.

michelangelo, shape generation, shape-image-text aligned latent representation, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.59)

Add feedback

DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation

Neural Information Processing SystemsDec-26-2025, 21:53:16 GMT

Recent Diffusion Transformers (i.e., DiT) have demonstrated their powerful effectiveness in generating high-quality 2D images. However, it is unclear how the Transformer architecture performs equally well in 3D shape generation, as previous 3D diffusion methods mostly adopted the U-Net architecture. To bridge this gap, we propose a novel Diffusion Transformer for 3D shape generation, named DiT-3D, which can directly operate the denoising process on voxelized point clouds using plain Transformers. Compared to existing U-Net approaches, our DiT-3D is more scalable in model size and produces much higher quality generations.Specifically, the DiT-3D adopts the design philosophy of DiT but modifies it by incorporating 3D positional and patch embeddings to aggregate input from voxelized point clouds.To reduce the computational cost of self-attention in 3D shape generation, we incorporate 3D window attention into Transformer blocks, as the increased 3D token length resulting from the additional dimension of voxels can lead to high computation.Finally, linear and devoxelization layers are used to predict the denoised point clouds. In addition, we empirically observe that the pre-trained DiT-2D checkpoint on ImageNet can significantly improve DiT-3D on ShapeNet.Experimental results on the ShapeNet dataset demonstrate that the proposed DiT-3D achieves state-of-the-art performance in high-fidelity and diverse 3D point cloud generation.

diffusion transformer, dit-3d, exploring plain diffusion transformer, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.39)

Add feedback

Filters

Collaborating Authors

shape generation

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

ea1a7f7bc0fc14142106a84c94c826d0-Paper-Conference.pdf

d6c01b025cad37d5c8bab4ba18846c02-Supplemental-Conference.pdf

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation

DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation (Supplementary Material)

d6c01b025cad37d5c8bab4ba18846c02-Paper-Conference.pdf

655846cc914cb7ff977a1ada40866441-Supplemental-Conference.pdf

655846cc914cb7ff977a1ada40866441-Paper-Conference.pdf

ShapeCrafter: ARecursiveText-Conditioned 3DShapeGenerationModel

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation

DiT-3D: Exploring Plain Diffusion Transformers for 3D Shape Generation